An Implementation of Support Vector Machines and Generalized Discriminant Analysis on Iris and Hepatitis Datasets
نویسندگان
چکیده
Support Vector Machines (SVM) is a supervised learning method used for classification. The learning strategy of SVM is based on structural risk minimization principle, so SVM has a better ability to generalize than other methods which depend on empirical risk minimization principle. However, when any classification methods face a dataset which is linearity inseparable, they will face a difficulty to classify the dataset. This problem results in low classification rate averages. To anticipate this problem, it is desirable to use Generalized Discriminant Analysis (GDA) as feature extractor. We expect that using GDA will give a better classification rate averages because it can minimize the distances of data within the same classes and maximize the distance between the different classes. This paper presents a comparison of Support Vector Machines with and without using GDA for Iris and Hepatitis datasets classification. It is shown that the use of GDA can yield a classification rate averages of more than 93% for Iris dataset and 95% for Hepatitis dataset.
منابع مشابه
A prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)
Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...
متن کاملA prediction distribution of atmospheric pollutants using support vector machines, discriminant analysis and mapping tools (Case study: Tunisia)
Monitoring and controlling air quality parameters form an important subject of atmospheric and environmental research today due to the health impacts caused by the different pollutants present in the urban areas. The support vector machine (SVM), as a supervised learning analysis method, is considered an effective statistical tool for the prediction and analysis of air quality. The work present...
متن کاملکاربرد الگوریتمهای دادهکاوی در تفکیک منابع رسوبی حوزۀ آبخیز نوده گناباد
Introduction: Reduction of sediment supply requires the implementation of soil conservation and sediment control programs in the form of watershed management plans. Sediment control programs require identifying the relative importance of sediment sources, their quantitative ascription and identification of critical areas within the watersheds. The sediment source ascription is involves two...
متن کاملRelationships Between Support Vector Classifiers and Generalized Linear Discriminant Analysis on Support Vectors
The linear discriminant analysis based on the generalized singular value decomposition (LDA/GSVD) has recently been introduced to circumvents the nonsingularity restriction that occur in the classical LDA so that a dimension reducing transformation can be effectively obtained for undersampled problems. In this paper, relationships between support vector machines (SVMs) and the generalized linea...
متن کاملA comparative study of performance of K-nearest neighbors and support vector machines for classification of groundwater
The aim of this work is to examine the feasibilities of the support vector machines (SVMs) and K-nearest neighbor (K-NN) classifier methods for the classification of an aquifer in the Khuzestan Province, Iran. For this purpose, 17 groundwater quality variables including EC, TDS, turbidity, pH, total hardness, Ca, Mg, total alkalinity, sulfate, nitrate, nitrite, fluoride, phosphate, Fe, Mn, Cu, ...
متن کامل